Linear Discriminant Analysis F-Ratio for Optimization of TESPAR & MFCC Features for Speaker Recongnition

نویسندگان

K. Anitha Sheela

K. Satya Prasad

چکیده

This paper deals with implementing an efficient optimization technique for designing an Automatic Speaker Recognition (ASR) System, which uses average F-ratio score of TESPAR(Time Encoded Signal Processing And Recognition) and MFCC(Mel frequency Cepstral Coefficients) features, to yield high recognition accuracy even in adverse noisy conditions. A new ranking scheme is also proposed in order to stabilize the rank of features in various noise levels by taking Arithmetic Mean of the F-Ratio scores obtained from various levels of Signal to Noise Ratio (SNR). The result is presented for a Text-Dependent ASR system with 20 speaker database. An RBF (Radial Basis Function) Neural Network is used for Recognition purpose. Also a comparative study has been performed for recognition accuracies of optimized MFCC and TESPAR features and we conclude that new proposed average F-Ratio technique has resulted in better accuracy compared to simple F-ratio in noisy environment and also we came to know that TESPAR features are more redundant compared to MFCC. Index Terms ASR, F-Ratio, Average F-Ratio, TESPAR, RBF Neural Network, MFCC.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Forensic Speaker Verification in Noisy Environmental by Enhancing the Speech Signal Using ICA Approach

We propose a system to real environmental noise and channel mismatch for forensic speaker verification systems. This method is based on suppressing various types of real environmental noise by using independent component analysis (ICA) algorithm. The enhanced speech signal is applied to mel frequency cepstral coefficients (MFCC) or MFCC feature warping to extract the essential characteristics o...

متن کامل

UNIVERSITY OF WEST BOHEMIA IN PILSEN DEPARTMENT OF CYBERNETIC Optimization of Features for Robust Speaker Recognition

Currently, the old feature extraction method, which was used early for speech recognition, is used in speaker recognition in our speaker recognition group. Standard Mell Frequency Cepstral Coefficients (MFCC) features are used. They can be extended by delta and acceleration coefficients eventually. Whereas features for speech recognition has been evolved and optimized until now, features for sp...

متن کامل

Robust speaker identification based on perceptual log area ratio and Gaussian mixture models

This paper presents a new feature for speaker identification called perceptual log area ratio (PLAR). PLAR is closely related to the log area ratio (LAR) feature. PLAR is derived from the perceptual linear prediction (PLP) rather than the linear predictive coding (LPC). The PLAR feature derived from PLP is more robust to noise than the LAR feature. In this paper, PLAR, LAR and MFCC features wer...

متن کامل

Speaker Identification Based on Log Area Ratio and Gaussian Mixture Models in Narrow-Band Speech: Speech Understanding / Interaction

Log area ratio coefficients (LAR) derived from linear prediction coefficients (LPC) is a well known feature extraction technique used in speech applications. This paper presents a novel way to use the LAR feature in a speaker identification system. Here, instead of using the mel frequency cepstral coefficients (MFCC), the LAR feature is used in a Gaussian mixture model (GMM) based speaker ident...

متن کامل

Deep feature for text-dependent speaker verification

Recently deep learning has been successfully used in speech recognition, however it has not been carefully explored and widely accepted for speaker verification. To incorporate deep learning into speaker verification, this paper proposes novel approaches of extracting and using features from deep learning models for text-dependent speaker verification. In contrast to the traditional short-term ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Journal of Multimedia

دوره 2 شماره

صفحات -

تاریخ انتشار 2007

Linear Discriminant Analysis F-Ratio for Optimization of TESPAR & MFCC Features for Speaker Recongnition

نویسندگان

چکیده

منابع مشابه

Forensic Speaker Verification in Noisy Environmental by Enhancing the Speech Signal Using ICA Approach

UNIVERSITY OF WEST BOHEMIA IN PILSEN DEPARTMENT OF CYBERNETIC Optimization of Features for Robust Speaker Recognition

Robust speaker identification based on perceptual log area ratio and Gaussian mixture models

Speaker Identification Based on Log Area Ratio and Gaussian Mixture Models in Narrow-Band Speech: Speech Understanding / Interaction

Deep feature for text-dependent speaker verification

عنوان ژورنال:

اشتراک گذاری